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AN EFFICIENT ESTIMATOR FOR LOCALLY STATIONARY 
GAUSSIAN LONG-MEMORY PROCESSES 1 

By Wilfredo Palma and Ricardo Olea 

Pontificia Universidad Catolica de Chile 

This paper addresses the estimation of locally stationary long- 
range dependent processes, a methodology that allows the statisti- 
cal analysis of time series data exhibiting both nonstationarity and 
strong dependency. A time-varying parametric formulation of these 
models is introduced and a Whittle likelihood technique is proposed 
for estimating the parameters involved. Large sample properties of 
these Whittle estimates such as consistency, normality and efficiency 
are established in this work. Furthermore, the finite sample behavior 
of the estimators is investigated through Monte Carlo experiments. 
As a result from these simulations, we show that the estimates behave 
well even for relatively small sample sizes. 

1. Introduction. Even though stationarity is a very attractive theoreti- 
cal assumption, in practice most time series data fail to meet this condition. 
As a consequence, several approaches to deal with nonstationarity have been 
proposed in the literature. Among these methodologies, differentiation and 
trend removal are popular choices. Other approaches include, for instance, 
the evolutionary spectral techniques first discussed by Priestley (1965). In 
a similar spirit, during the last decades a number of new time- varying de- 
pendence models have been proposed. One of these methodologies, the so- 
called locally stationary processes developed by Dahlhaus (1996, 1997), has 
been widely discussed in the recent time series literature, see, for example, 
Dahlhaus (2000), von Sachs and MacGibbon (2000), Jensen and Whitcher 
(2000), Guo et al. (2003), Genton and Perrin (2004), Orbe, Ferreira and 
Rodriguez-Poo (2005), Dahlhaus and Polonik (2006, 2009), Chandler and 
Polonik (2006), Fryzlewicz, Sapatinas and Subba Rao (2006) and Beran 
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(2009), among others. This approach allows the stochastic process to be non- 
stationary, but assuming that the time variation of the model is sufficiently 
smooth so that it can be locally approximated by stationary processes. 

On the other hand, during the last decades, long-range dependent data 
have arisen in disciplines as diverse as meteorology, hydrology, economics, 
etc., see, for example, the recent surveys by Doukhan, Oppenheim and Taqqu 
(2003) and Palma (2007). As a consequence, statistical methods for model- 
ing that type of data are of great interest to scientists and practitioners from 
many fields. At the same time, many of these long-memory data also display 
nonstationary behavior, see, for instance, Granger and Ding (1996), Jensen 
and Whitcher (2000) and Beran (2009). Nevertheless, most of the currently 
available methods for dealing with long-range dependence are incapable of 
modeling time series with these features. In particular, much of the theory 
of locally stationary processes applies only to time series with short mem- 
ory, such as time- varying autoregressive moving average (ARMA) processes 
and not to time series exhibiting both nonstationarity and strong depen- 
dence. In order to treat that type of data, this paper addresses a class of 
strongly dependent locally stationary processes. In particular, these mod- 
els include a Hurst parameter which evolves over time. Following Dahlhaus 
(1997), we propose a Whittle maximum likelihood estimation technique for 
fitting Gaussian long-memory locally stationary models. This is an extension 
of the spectrum-based likelihood estimator introduced by Whittle (1953). A 
great advantage of this estimation procedure is its computational efficiency, 
since it only requires the calculation of the periodogram by means of the 
fast Fourier transform. Additionally, we prove in this article that the pro- 
posed Whittle estimator is asymptotically consistent, normally distributed 
and efficient. Thus, this paper provides a framework for modeling and mak- 
ing statistical inferences about several types of nonstationarities that may 
be difficult to handle with other techniques. For instance, changes in the 
variance of a time series could be spotted by simple inspection of the data. 
However, variations on the dependence structure of the data are far more 
difficult to uncover and model. 

The remainder of this paper is structured as follows. Section 2 discusses 
a class of long-memory locally stationary processes and proposes a quasi 
maximum likelihood estimator based on an extended version of the Whittle 
spectrum-based methodology. The consistency, asymptotic normality and 
efficiency of these quasi maximum likelihood estimators are established. 
Applications of the asymptotic results to some specific locally stationary 
processes are also presented in this section. Proofs of the theorems are pro- 
vided in Section 3. Note that the techniques employed by Dahlhaus (1997) 
to show the asymptotic properties of the Whittle estimates are no longer 
valid for the class of long-memory locally stationary processes discussed in 
this paper. This difficulty is due to the fact that these processes have an 
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unbounded time-varying spectral density at zero frequency. Consequently, 
several technical results must be introduced and proved. Section 4 reports 
the results from several Monte Carlo experiments which allow to gain some 
insight into the finite sample behavior of the Whittle estimates. Conclusions 
are presented in Section 5 while auxiliary lemmas are provided in a techni- 
cal appendix. Additional examples and simulations along with a comparison 
of the Whittle estimator with a kernel maximum likelihood estimation ap- 
proach and two real-life applications of the proposed methodology can be 
found in Palma and Olea (2010). The bandwidth selection problem for the 
locally stationary Whittle estimator is also discussed in that paper, from an 
empirical perspective. 

2. Definitions and main results. 

2.1. Long-memory locally stationary processes. A class of Gaussian lo- 
cally stationary process with transfer function A can be defined by the 
spectral representation 

(1) Y t>T = ^ Al T (\)e iXt dB(X), 

J —n 

for t = 1, . . . ,T, where B(X) is a Brownian motion on [— tt,tt] and there is 
a positive constant K and a 27r-periodic function A : (0, 1] xR->C with 
A(u, —A) = A(u, A) such that 

(2) sup 

t,x 

for all T. The transfer function A® T (\) of this class of nontstationary pro- 
cesses changes smoothly over time so that they can be locally approximated 
by stationary processes. An example of this class of locally stationary pro- 
cesses is given by the infinite moving average expansion 




where {ej} is a zero- mean and unit variance Gaussian white noise and 
{i/jj(u)} are coefficients satisfying ^2'jLoipj{ u ) 2 < 00 f° r an u £ [0, 1]- In this 
case, the transfer function of process (3) is given by A® T (X) = 
cr(^) Sjlo^j(T) e ~ jAj = A), so that condition (2) is satisfied. The model 
defined by (3) generalizes the Wold expansion for a linear stationary pro- 
cess allowing the coefficients of the infinite moving average expansion vary 



<t(A) 



Mj!> X 



< 



K 
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smoothly over time. A particular case of (3) is the generalized version of the 
fractional noise process described by the discrete-time equation 



(4) Y t , T = -( * ) (1 - B)-^e t = *(|) f> (!) £ *-- 

for t = 1,...,T, where {et} is a Gaussian white noise sequence with zero 
mean and unit variance and the infinite moving average coefficients {i]j(u)} 
are given by 

where T(-) is the Gamma function and d(-) is a smoothly time- varying long- 
memory coefficient. For simplicity, the locally stationary fractional noise 
process (4) will be denoted as LSFN. 

A natural extension of the LSFN model is the locally stationary autore- 
gressive fractionally integrated moving average (LSARFIMA) process de- 
fined by the equation 

(6) *(i/T, B)Y tiT = a(t/T)G(t/T, B)(l - B)- d ^e u 

for t = l,...,T, where for u£ [0,1], $(«, B) = l + 4>\(u)B H h 4> P (u)B p 

is an autoregressive polynomial, @(u, B) = 1 + 9\(u)B + • • • + 9q(u)B® is a 
moving average polynomial, d(u) is a long-memory parameter, cr(u) is a noise 
scale factor and {et} is a Gaussian white noise sequence with zero mean and 
unit variance. This class of models extends the well-known ARFIMA process, 
which is obtained when the components B), &(u,B), d(u) and <j(u) 
appearing in (6) do not depend on u. Note that by Theorem 4.3 of Dahlhaus 
(1996), under some regularity conditions on the polynomial &(u,B), the 
model defined by (6) satisfies (1) and (2), see Jensen and Whitcher (2000) 
for details. 

2.2. Estimation. Let 6 € be a parameter vector specifying model (1) 
where the parameter space is a subset of a finite-dimensional Euclidean 
space. Given a sample {Yip, ■ ■ ■ , Yt,t} of the process (1) we can estimate 9 
by minimizing the Whittle log-likelihood function 

P> ^ = 5Fs£E{x*AKA) + ^f}*, 

where fe(u, X) = \ Ag(u, A)| 2 is the time-varying spectral density of the lim- 

A)| 2 

m 



iting process specified by the parameter 9, In{u, A) = ^g^^/o) ^ s a tapered 



periodogram with 



N-l , \ N-l / xfc 

D N (u,X) = h(^\Y [uT] „ N/2+s+hT e-* Xs , H KN = ^ h< 



N) 6 
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T = S(M -l) + N, uj = tj/T, tj = S(j - 1) + N/2, j = 1, . . . ,M and h(-) 
is a data taper. The intuition behind this extended version of the Whittle 
estimation procedure (7) is as follows: the sample {Yi,T> • ■ • >^t,t} is subdi- 
vided into M blocks of length N each shifting S places from block to block. 
For instance, if we split a time series of T = 652 observations into M = 100 
blocks of length N = 256 each, shifting S = 4 positions forward each time we 

get the blocks (Yi |652 , F 2|65 2, . . . , 1256,652), ■ ■ ■ , (^397,652,^398,652, • ■ • , *652,652)- 

Then, the spectrum is locally estimated by means of the data tapered pe- 
riodogram on each one of these M = 100 blocks and then averaged to form 

(7) . Finally, the Whittle estimator of the parameter vector 9 is given by 

(8) 6t = argmin£T(#)j 

where the minimization is over a parameter space 0. The analysis of the 
asymptotic properties of the Whittle locally stationary estimates (8) is dis- 
cussed in detail next. Before stating these results, we introduce a set of the 
regularity conditions. 

2.3. Assumptions. The first assumption below is concerned with the 
time-varying spectral density of the process. The second assumption is re- 
lated to the data tapering function and the third assumption is concerned 
with the block sampling scheme. It is assumed that the parameter space 
is compact. In what follows, K is always a positive constant that could be 
different from line to line. 

Al. The time- varying spectral density of the limiting process (1) is strictly 
positive and satisfies 

fe(u,\)~C f (0,u)\\\- 2d eM, 

as |A| — > 0, where Cf(8,u) > 0, < mfg tU de(u), supg u dg{u) < \ and dg{u) 
has bounded first derivative with respect to u. There is an integrable function 
g(X) such that | Vg log fg(u, A) | < g(X) for all 9 G 0, u G [0, 1] and A G [-vr,7r]. 
The function A(u, A) is twice differentiable with respect to u and satisfies 

I A(u, X)A(v, -A) exp(ifcA) dX ~ C(9, u, 

J —TT 

as k — > oo, where \C(9,u,v)\ < K for u,v G [0, 1] and 9 G 0. The function 
fe(u, A)" 1 is twice differentiable with respect to 9, u and A. 

A2. The data taper h(u) is a positive, bounded function for u G [0, 1] and 
symmetric around ^ with a bounded derivative. 

A3. The sample size T and the subdivisions integers N, S and M tend to 
infinity satisfying S/N -> 0, VTlog 2 N/N -> 0, y/T/M -> and iV 3 log 2 N/ 
T 2 ^0. 
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Example 2.1. As an illustration of the assumptions described above, 
consider the extension of the usual fractional noise process with time- varying 
Hurst parameter, described by (4) and (5). The spectral density of this LSFN 
process is given by 

a 2 / y \ -2dfl(u) 



fe(u,\) = — ( 2 sin 

Note that this function is integrable over A E [— vr,7r] for every u E [0, 1] as 
long as dg{u) < ^ for all u E [0,1] and 9 E 0. Furthermore, we have that 

fo(u,X) ~ T^\\\~ 2de{u \ as A -> 0. By assuming that \Vgd e (u)\ < K, we have 
that |Vfllog/ e (u,A)| = |V e d e (V)||log(2sinf ) 2 | < K|log|A||, which is an inte- 
grable function in A E [— vr,7r]. In addition, from (5) the function A(u,X) of 
this process satisfies 

T[l -dg(u)-dg(v)]T[k + dg(u)) 



A(u,X)A(v,-X)exp(ikX)dX , 

T[l - do{u)\Y[dg{u)\Y[k + 1 - d g (y)\ 

for k > 0. Thus, by Stirling's approximation, we get 

A(u, X)A(v, - A) exp(^A) dX ~ ^ = I ^| fe^W^W^ , 

for —7- oo. Besides, a simple calculation shows that fg(u, A) -1 is twice dif- 
ferentiable with respect to u and A as long as dg(u) is twice differentiable 
with respect to u. Thus, under these conditions the time-varying spectral 
density fg(u,X) satisfies assumption Al. On the other hand, an example of 
data taper that satisfies assumption A2 is the cosine bell function 

(9) /i(» = §[l-cos(27ra)]. 

Note that if S = 0{N a ) and M = 0{N b ) then T = 0{N a+b ) for a + b > 1. 
Thus, by choosing exponents a and b such that (a, b) E C = {a < 1, | < a+b < 
2,a< b}, assumption A3 is fulfilled. Observe that the C is a nonempty set. 



2.4. Main results. Some fundamental large sample properties of the Whit- 
tle quasi-likelihood estimators (8), including consistency, asymptotic normal- 
ity and efficiency are established next. In addition, we establish an asymp- 
totic result about the estimation of the time- varying long-memory parameter 
for a class of locally stationary processes. The proofs of these four results 
are provided in Section 3. 

Theorem 2.1 (Consistency). Let 9q be the true value of the parame- 
ter 6. Under assumptions Al-A3 ; the estimator 9t satisfies 9? — > 9q, in 
probability, as T — > oo. 
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Theorem 2.2 (Normality). Let 9q be the true value of the parameter 9. 
If assumptions A1-A3 hold, then the Whittle estimator 9t satisfies a central 
limit theorem 

VT(0 T -6 o )^N[O,r(9 o )- 1 }, 
in distribution, as T — > oo, where 

(10) r(0) = ^ J J* [V log f e (u, A)] [V log f e (u, A)]' dX du. 

Theorem 2.3 (Efficiency). Assuming that conditions A1-A3 hold, the 
Whittle estimator 9? is asymptotically Fisher efficient. 

Remark 2.1. Recall that for astationary fractional noise process FN(cZ), 
the asymptotic variance of the maximum likelihood estimate of the long- 
memory parameter, d, satisfies 

lim TVar(d) = -=■. 

On the other hand, suppose that we consider a LSFN process where the long- 
memory parameter varies according to, for example, d(u) = ao + ot\u. Thus, 
in order to estimate d(u), the parameters ao and ot\ must be estimated. Let 
So an d Si be their Whittle estimators, respectively, so that d(u) = So + Sin. 
According to Theorem 2.2, the asymptotic variance of this estimate of d(u) 
satisfies 

~ 24 
lim TV&r[d(u)] = — (1 - 3u + 3u 2 ), 

and then integrating over u we get 

f 1 ~ 12 
lim T / Var\d(u)] du = — 

T->oo J Q TT 2 

Since two parameters are being estimated, on the average, the asymptotic 
variance of the estimate d(u) is twice the asymptotic variance of d from 
a stationary FN process. This result can be generalized to the case where 
three or more coefficients are estimated and to more complex trends, as 
established on the following theorem. 

Theorem 2.4. Consider a LSFN process (4) with time-varying long- 
memory parameter dp{u) = 2~Zj=i Pj9j( u )> where {gj(u)} are basis functions 
as defined in (12) below. Let d(u) = Yl^=i ^j9j( u ) ^ e the estimator of dp(u) 
for u G [0, 1] . Then under assumptions A1-A3 we have that 

(11) lim T [ \ai\d(u)}du = %. 
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Remark 2.2. Note that according to Theorem 2.4 the limiting average 
of the variances of d(u) given by (11) does not depend on the basis functions 
9j(-) for j = l,...,p. 

2.5. Illustrations. As an illustration of the asymptotic results discussed 
above, consider the class of LSARFIMA models defined by (6). The evolution 
of these models can be specified in terms of a general class of functions. For 
example, let {gj(u)}, j = 1, 2, . . . , be a basis for a space of smoothly varying 
functions and let dg(u) be the time- varying long-memory parameter in model 
(6). Then we could write dg(u) in terms of the basis {gj(u)} as follows: 

ft 

(12) l[d e (u)}=Y,u J g J (u), 

for unknown values of k and 9 = (ao, ot,\, ■ ■ ■ , ctft)', where £(•) is a known 
link function. In this situation, estimating 9 involves determining k and 
estimating the coefficients ao, • ■ • Important examples of this approach 
are the classes of polynomials generated by the basis {gj(u) = vP}, Fourier 
expansions generated by the basis {gj(u) = e mj } and wavelets generated 
by, for instance, the Haar or Daubechies systems. Extensions of these cases 
can also be considered. For example, the basis functions could also include 
parameters as in the case {gj(u) =e lu ^ j }, where {/3j} are unknown values. 

In order to illustrate the application of the theoretical results established 
in Section 2.4, we discuss next a number of combinations of polynomial and 
harmonic evolutions of the long-memory parameter, the noise variance, the 
autoregressive and moving average components of the LSARFIMA process 
(6). Additional examples are provided in Section 2 of Palma and Olea (2010). 



Example 2.2. Consider first the case P = Q = in model (6) where 
d(u) and a(u) are specified by 

p <? 
h[d(u)} = J2a j g j (u), € 2 [<7(«)] = ^^^(n), 

3=0 j=0 

for u£ [0,1], where ^i(-) and ^(") are differentiable link functions, gj(-) and 
hj (•) are basis functions. The parameter vector in this case is 9 = (ao, . . . , a p , 
/3q, . . . , (3 q )' and the matrix T can be written as 
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where 



~6~ 



1 9i{u)9j{u)du ' 

WMuW . 

1 hi(u)hj(u) du 
o [a(u)£' 2 (a(u))} 2 



i,j=0,. 



i,j=0, 



Example 2.3. As a particular case of the parameter specification of the 
previous example, consider the case P = Q = in model (6) where d(u) and 
cr(u) are both specified by polynomials, 

d(u) = ao + a±u + • • • + a p u p , a(u) = /3q + ftiu + • • • + /3 q u Q , 

for u G [0,1]. Similar to Example 2.2, in this case the parameter vector is 
8 = (oq, . . . ,a p ,(3o, . . . ,/3 g )', £i(u) = £2(11) = u and the matrix T given by (10) 
can be written as in (13) with 



7T 



6(i + j + l) 
1 



r, 



i,j=0,...,p 

u l+ ^ du 



o (A) + /3m + • • • + /V?) 5 



i,i=0,. 



The above integrals can be evaluated by standard calculus procedures; see, 
for example, Gradshteyn and Ryzhik [(2000), page 64] or by numerical in- 
tegration. 

Example 2.4. Considering now a similar setup as Example 2.3 with 
p = q = 1, but with link function £(■) = log(-) such that 

log[rf(it)] = a + aiu, log[a(n)] = f3 + fan, 

for u G [0, 1]. Then T can be written as (13) with 

ir 2 e 2ao [ 2a 2 (e 2ai -l) ai((2«i - l)e 2ai + 1) 

T 4ff[ [ai((2ai - l)e 2ai + 1) (2a 2 - 2ai + l)e 2ai + 1 

2 1 
1 2/3 



Example 2.5. Following with the assumption P = Q = in model (6), 
consider that d(u) and cr(u) are defined by the harmonic expansions 

d(u) = ao + a\ cos(Aiu) + • ■ • + a p cos(X p u), 
a{u) = /3o + Pi cos(ujiu) H h /3 q cos(uj q u), 
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for u G [0, 1], where Ao = 0, Xf A| for all i,j = 0, . . . ,p, j, loq = and 
ojf 7^ u)j for alH, j = 0, . . . , q, i 7^ j. For simplicity, the values of the frequen- 
cies {A,} and {ojj} are assumed to be known. As in Example 2.3, in this 
case the parameter vector is 9 = (ao, ■ ■ ■ , a p , Pq, . . . , f3 q )' and the matrix T 
appearing in (10) can be written as in (13) with 



and 



7T _ 

12 



sin(Aj — Xj) sin(Aj + Xj) 



Xi — Xj 



Xi + A,- 



i,j=0, 



r, 



7T 

12 



sin W; 



+ 



US; 



sin(oj.j + oj j 

UJi + OJj 



i,j=o,-,q 



Example 2.6. Consider now the case P = Q = 1 in model (6) where 
a(u) = 1 and d(u), Q(u,B), @(u,B) are specified by 

d(u) = a±u, 
$(u,B) = 1 + (j>(u)B, (f>(u) = a 2 u, 
G(u,B) = l + 6(u)B, 9(u) = a 3 u, 

for uG [0,1]. In this case, the parameter vector is 9 = (an, ctz, 0:3)', with 
< a,\ < 2, \oLj\ < 1, j = 1, 2 and the matrix T from (10) can be written as 



where 



711 



713 



722 



723 



with g(x) 
(-1,0). 



r 



1 l + ai 
2af °^ 1 — a\ 

1 

2a7I 

1 . l + a 2 
2^f l0g — 



711 712 713 
721 722 723 
731 732 733 



— . 712 



a 



g(Qi« 2 ) 1 

(aia^) 3 / 2 ai«2 





"1 1 " 




1 " 


log(l + ai)| 


{ 










2 ~ o7_ 









1 

2a^ 



«2 
1 



2 ' 



log(l + a 2 ) 



1 

a?2 



733 



7T 

18 : 



arctanh(y / x) for x G (0, 1) and g(x) = arctan(- v /— x) for x G 
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3. Proofs. This section is devoted to the proof of Theorems 2.1-2.4. Be- 
fore presenting the proofs of these results, we introduce and prove three 
useful propositions which are of independent interest. These propositions 
involve the large sample properties of the functional operator defined next. 
Consider the function (f>: [0, 1] x [— ir,ir] — > M and define the functional oper- 
ator 

(14) J(4>)= / / (/>(u,\)f(u,X)d\du, 

JO J-n 

where f(u,X) is the time- varying spectral density of the limit process (1). 
Define the sample version of J(-) as 

M 

(15) J ^) = T?E/ Hu j A)lN(u„X)dX, 

3=1 J -* 

where M and Uj, j = 1, . . . ,M are given in Section 2. Furthermore, define 
the matrix 

(16) Q(u) = ( ^ 4>(u, X)e iX( - s ~V dX 

' s,t=l,...,N 

and the block-diagonal matrix Q(4>) = diag[Q(ui), . . . , Q(um)]- For nota- 
tional simplicity, sometimes in what follows we have dropped 8 from dg(u) 
so that it becomes d{u). 

Remark 3.1. Since the function A(u, X) and the spectral density f{u, X) 
of a locally stationary long-memory process are unbounded at zero frequency, 
the techniques used next to prove the large sample properties of J{4>) and 
the quasi-likelihood estimators are different from those used in the short- 
memory context. For instance, the function A{u, X) does not satisfy the key 
assumption A.l of Dahlhaus (1997) or the coefficients of (3) fail to 

meet conditions (2) and (3) of Dahlhaus and Polonik (2009). Due to the 
unboundeness of f(u,X) at the origin, our proofs exploit the properties of 
the Fourier transforms 

fM= r f(u,X)e iX dX, 

J —IT 

f(u,v,-):= [ A(u,X)A(v,-X)e iX - dX. 



3.1. Propositions. 



Proposition 1. Let f(u,X) be a time-varying spectral density satisfy- 
ing assumption Al and assume that the function (ft(u, X) appearing in (14) 
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is symmetric in A and twice differentiable with respect to u. Let f(u,k) and 
4>{u, k) be their Fourier coefficients, respectively. If there is a positive con- 
stant K such that 

\f(u,k)${u,k)\<K ( 



k 2 J' 

for all u £ [0, 1] and k > 1, then, under assumptions A2 and A3 we have 
that 

E[J T ^)) = J( (t) ) + o( l ^P\+o( 



N \M 



Proof. From definition (15), we can write 

M 



E[J T {4>)] = TfJ2 r ^ j ,X)E[I N (u j ,X)]d\ 

H^,\)E\D N ( U:i ,\)\ 2 d\ 



2irMH 2 

' J- 



c(u,t,s) = E(Y[ uT ^ N / 2+t +l,TY[uT]-N/2+s+l,T)- 
1 



where 

Thus, 
£[J T (0)] 



27rM# 2jA r(0) 

j=lt,s=0 V 7 V 7 ^ 



M JV-1 



^ JW iv — x / t \ / S \ 

j^oj E E *( 77 j h U J c( "'' i- * , * ( "' ' ' " 



x <j>(uj t k)(2-6 k ), 
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where 6 k = 1 for k = and 5 k = for k ^ 0. By assumption A2 and Taylor's 
theorem, 



, t k 
hl N + N 



k 



for some £ tA N e (j?, t ^r), for k > 0. Thus, 



(17) 



M N-lN-t / f \ 

ElJ ^ ] = ^mh^W) EE E " 2 U > ( "" M+ k) 

Z ' JV v ' j=l t=0 fc=o v 7 

x 4>( Uj ,k)(2 - 5 k ) 
- M N-lN-t , v 

yTTTo) EEE A b ^(e tl fc,iv)c(^ , t, t + A;) 

Z ' JV v ; j=i t=0 fc=0 v 7 



M N-lN-t 

2nMH 2 ~ 

x (f)(uj,k)(2 - 5 k ) 

2irMH 2 ~ 

- ' j=l t=u fc= 

x (j)(uj,k)(2 - 5 k ). 

Under assumption Al, we can expand c(u,t,t + A;) by Taylor's theorem as 
c(u, i, i + A) 

= f(u,k) + f(u,k)cpi(u,k)\ 



t + l-N/2\ 
T J 



A 



+ f(u,k)^ 2 (u,k)[ - +R(u,t,k,N,T), 



where 



<pi(u,k) 
C\(9, u, u) 

ip 2 (u,k) 
C 2 (9,u,u) 



Ci(9,u,u) 
C{9,u,u) 

dC(9,u,u + v 



+ 2d'{u) log A;, 



du 

C 2 (9,u,u) 
C(9,u,u) 

dC(9,u,u + v 



v=0 

+ d'(u) log A, 



Ov 



v=0 



d'{u) = 
is given by 



, C(9, u, v) is defined in assumption Al and the remainder term 



R(u,t,k,N,T) = 0{ f(u,k) 



( ? 



loK 2 k\. 
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Thus, since by assumption Al < K for all u £ [0, 1], we have \ fj(u, i 

K log k for j = 1,2 and k > 1 . Now we can write 

N-t 



k=0 



c(uj,t,t + k)4>(uj,k)(2 — 5k) 

N-t 



k=0 
N-t 



(18) + £ fiuj^Muj, k)M^,k)(2 - 5 k )( t + 1 N/2 \ 

k=o ^ ' 



fc 



+ ^ f( u j^ k )H u j^ k ) i P2(j,k)(2 - 8 k ) 

k=0 
N-t 

+ ]T i2(u i5 t, k, N, T)0( Uj ,k){2 - 5 k ) 



k=0 



Since by assumption \ f(u,k)4>(u,k)\ < Klogk/k 2 , for k > 1, uniformly 
it 6 [0, 1], we conclude that there is a finite limit -A(it) < oo such that 



N 

A{u)= lim y)/(«,fc)0(«,fc)(2- < y fc ). 



fc=0 

Consequently, 

JV-l / , \ AT-i 



/v — i / j. \ - /v— c 

^ ' k=0 
N-l / f \ N- 1 

E^fiv) Eft^fe'W 2 - 5 *) 

t=o ^ ' fc=0 

AT-l / j. \ W-l 

EMjv) E /K-,fc)0(«j J fc)(2-*fc) 

oo 



t=0 K / k=N-t+l 
N-l 



t=o v 



+ 0(log 2 iV), 



k=N 
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by Lemma 7. Hence, 



N-l / f \ N ~ t 

E h * ( n ) E /(«J, *)(2 " S k ) 



N 

A( Uj ) Y, h 2 
t=o 

+ 0(log 2 N) 



fc=0 
N-l 



i=0 ^ ' fc=7V 



But, 



k=N 

and consequently, 

JV-i 



< 



log k ( log iV 



A- 2 



t=0 

Therefore, 
N-l / , \ N—t 



t=0 ^ ' fc=iV 



O(logiV). 



JV-1 . v JV-i ^ JV_i 

E ^ [m E *)(2 - «*) = A( Uj ) Yh 2 [-)+ 0(log 2 iV). 

t=0 ^ ' fc=0 t=0 ^ ' 

On the other hand, by analyzing the term involving the second summand of 
(18) we get 

N-i / s JV-t 

E M jy) E^^'^^i'^^J'^^ 2- ^) 

t=0 ^ ' fc=0 



JV-1 



E" 2 (^ 



i=0 
iV-1 



JV-1 



^ 991 (itj , , , fe) (2 - S k ) 



k=0 



N-l / * \ N ~ l 

+ E h2 1 7v ) E k )H u j, k )T( u j, k ) 

+— n \ / !■ — w_+_li 



t=0 
'JV-1 



fc=JV-m 

t \ ft+l-N/2 



t + 1- N/2 
T 



i+l-iV/2\ 
T J 



T 



'N-l 



Y Vi( u j, k )f(uj,k)(j)(u j ,k)(2 - 5 k ) 



k=0 



+ 



N\og 2 N 
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by Lemma 8. Now, since h(-) is symmetric around 1/2, we have 



N-i 
t=o 



N ) { T~ 



O 



Besides, | Ej£oVi (uj , k)f(uj , k)4>(uj ,k)(2-5 k )\<K £^ =1 
sequently, 



< oo. Con- 



JV-l , s N-t 

Y ^ 2 ( ]v ) Y k )f( u j' k )H u j, k )( 2 - fa) 

t=0 ^ ' fc=0 



t + l-iV/2\ 



r 



= o(^log 2 iv). 
The third term of (18) can be bounded as follows: 



N-t 



^2ip 2 {u j ,k)f(u j ,k)^(u j ,k)k 



k=0 



N 



k=l 



and then 



JV-i 



Yl h2 ( ) Yl ^ 2 ^' k )f( u 3> k)0(uj,k) 

t=0 ^ ' k=0 

The last term of (18) can be bounded as follows: 

N-t 



N-t 



T 



R{uj ,t,k,N, T)<j>{uj ,k)(2-5 k ) 



k=0 

and then 

N-lN-t 



<K^log 2 N. 



N 



<Klog'N{ - ) , 



iV — ± I\ — L / v 

E E ^ ( n ) *' ^ T) ^' ' fc) (2 " 6k) 

t=0 k=0 ^ ' 



N 3 n 

<K^log 2 iV. 



Note that by assumption A3, the term above converges to zero as N, T — > oo. 
Therefore, the first term in (17) can be written as 



M N-l , . 
J^)Y S fc 2 ^Jc(« i ,t,t + /»)0(« i ,fe)(2-* fc ) 

1 ^ „, , /log 2 iV\ 
— > A(ui) + 0[ — . 

.7=1 V 7 



2itMH 2 
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Now, by Lemma 1 we can write A(u) = 2ir J 6(u,u})f(u,uj)du, and then 



(19) 



1 M ( 1 \ 

2ttM ^ V 3! VW \MJ 



On the other hand, the second term in (17) can be bounded as follows: 

\c(uj,t,t + k)<fi(uj,k)(2 — Sk)\ 



TV 



+ — \f{uj,k)(j}(uj,k)(p2{uj,k)\ + ^2 log N\f(uj,k)<j>(uj,k 

Since |(^j(uj, /c)| < ETlog/c for i = 1,2, j = 1, . . . ,M and A; > 1, we conclude 
that 

iV log* 

|c(« i ,t,t + fc)0(u i ,fc)(2-<5 fc )| <K—-j^-. 



Therefore, since |/V(it)| < if for u <G [0, 1] by assumption A2, we have 
N—t 

:,k,N 



^2 c{uj,t, t + k)(j>(uj,k)—h'(£ t ,k 

fc=0 

Consequently, 

JV-l / , \ N-t 



c K^logk^ K 



log 2 iV 



T ^ k 

k=l 



/ t \ ^ k 

t=o ^ ' fc=0 



log 2 iV 



Hence, the second term of (17) is bounded by K (log 2 N)/T. From this and 
(19), the required result is obtained. □ 

Proposition 2. Let f(u, A) be a time-varying spectral density satisfying 
assumption Al. Let 0i, 02 : [0, 1] x [— 7r,7r] — )■ K 6e too functions such that 
4>i(u,X) and 4>2(u,X) are symmetric in X, twice differ entiable with respect 
to u and their Fourier coefficients satisfy \4>i(u,k)\, \4>2{u,k)\ < K\k\~ 2d ^~ l 
for u£ [0, 1] and \k\ > 1. Lf assumptions A2 and A3 hold, then 

lim T cov[Jt(4>i), Jt{4>2)] = 47r / / 4>\{u, X)4>2{u, X)f(u, A) 2 dXdu. 

Proof. We can write 

rcov[J T (0i),J T (02)] 
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M 



4 Z / / M u j>*)M u k,ij) 



x cov[ijv(ttj, A), I^{uk, A*)] d\ dfj,. 

But, 

cov[I N (uj, X),I N (u k ,fJ.)] 

cov(\D N (uj,\)\ 2 ,D N (u k ,n)\ 2 ) 



[2itH 2 , n {0)} 



2 



N-l 



x e i M s -*)+ i M" 5 '~p) 

X COv(Y[ UjT ]_ A r/2+s+l,T^ / [t tj T]-Af/2+i+l,r) 

Y [u k T] -N/2+p+l,TY[u k T] -N/2+m+l,T)- 

Now, an application of Theorem 2.3.2 of Brillinger (1981) yields 
cov[I N (uj,\),I N (u k ,fj,)] 
1 

" [2^ 2iiV (0)] 2 

* E ^)^>(£>(|>" ( --~ ) 

t,s,p,m=0 \ /\ /\ /\ / 

X {cOv(Y[ Uj . T ]_jv/2+t+l,T) ^ / [u fe T]-7V/2+m+l,T) 
X COv(Yj Uj .T]_jv/2+s+l,T, ^ / [« fc T]-Ar/2+p+l,T) 
+ COv(Yj Uj .T]__/v/2+t+l,T, 5 / [« fc T]-AT/2+p+l,T) 
X COv(Y[ UjT ]_ N / 2+s+ i,T,Y[u k T]-N/2+p+l,T)} 

1 



[2^^ 2 ,iv(0)] 2 

X ^^ fc _7V/2+l+.,T( 2; ) /l (^7)^-^ 

x [A t . _ N/2+1+ . tT (y)h(j^j ,—y — X 



Thus, 



where 
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x H N ^A° tk _ N/2+1+ . T (y)h(^ — ^,y + f?J 
x e iX{s - t)+if " {m " p) dxdy 

H N\A t ._ N/2+1+ . T (x)h(^\x-x 



[2irH 2 , N (0)] 




7T J —7V 



x H NyA^ N/2+1+ . T {x)h\^—yx + n 

x H N (Al_ N/2+1+ . T (y)h(— \-y- A 

x H N (^ h _ N/2+1+ . tT (y)h(—\y-fi 
x e iX{s - t)+i ^ m - p) dxdy. 



(20) rcov(Jr^i), JrOfc)) = ^n^ i B N + B n\ 



[2irMH 2 , N (0)}< 



M , 

- V / Mu j A)Muk,^H N [A 



b n = Z_> \_<l>i{u h \)<t>2(u k ,n)H N [ A t ._ N/2+1+ , T (x)hl — ),A 



x H N yA° tk _ N/2+1+ , T (x)h (^—Xx-^H. 
x f^t,-jv/2+i+.,T(y)^(]y).-f- A 



x ^( A ? fc -Jv/2+i+,T(y) /i (^)>y + / x ) 



with 11= [— vr,7r] 4 , and 



^ = E ^l( U i' A )^2K^)^A r ^°_Ar/2+l+-,r( x ) /l (]Y)' A " X 
X ^(^-^/2+l+-,T( X ) /l (^)' X + / i ) 

x ^ivM? 3 ._iv/2+i + .,T(y)^(^Y-y- A 
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x H N\^ k -N/W+. t T(v) h (jj)>V-l*) 
x e i{x+y) ^- tk) dxdydfidX. 



The term 13$ can be written as follows: 

M r 



(21) 



x A(uj,x)A(uk,-x)A(uj,y)A(u k , -y) 

x H N {\ - x)H N {x - n)H N (^ + y)H N (-y - A) 

x e i(x+v) ^- tk) dx dy dXdfi + R N 

M 

/ 4>i(uj,x)A(u j ,x)A(u k ,-x)4>2(u k ,y)A(u j ,y) 



j,k=i 



n 



x A(u k , -y)H N (X - x)H N {x - ^)H n {^l + y) 
x H N {-y - \)e i{ - x+y) ^- tk) dxdy dXd^i 



+ + + R N , 



with 



A (i) 



M „ 

V] / [^(^ji A) — 0i (uj, x)]</>2('!Xa ; , fj,)A(uj, x)A(u k , —x)A(uj , y) 



j,k=l 



x A(*u fe , -y)H N (X - x)H N (x - fi)H N (fi + y) 
x H N {-y - \)e i{x+y){t i- tk) dxdydXdfi, 

M f 

A iv = / ^i( u i. s )[^(Mb/i)-'/ , 2K,'/P(«i^)^(ttfc,-a;)^(«i,«/) 



i,fc=l 



II 



x A^fc, -y)H N (X - x)H N (x - fJ,)H N (fj, + y) 
xH N {-y- X)e i( - x+y ^ ~^ dx dy dX dfi, 
and by Lemma 2 the remainder term Rn can be bounded as follows: 

AI r 

/ 4>i{u j ,X)(f)2(u k ,Li)A(u j ,x)A(u k ,-x) 



\Rn\ < J 



j,k=V 



xA( Uj ,y)y- d ^L N (y + ti) 

(22) 
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X Hn(\ — x)Hn(x — fi) 

x H N (-y - A)e i(x+y)( ^-* fe) dxdydXdn 
By integrating with respect to n the term can be written as 

AI N-l 



A I 1 t c — \ / \ / 



j,k=lt,s=0 

P7Y P7T /*7T 

x / / / [(/>i(uj,X) - 4>i(uj,x)]A(uj,x)A(u k ,-x) 



x A(uj,y)A(u k ,-y)H N (\ - x)H N (-y - A) 
x e i( x +y)( tj -t k )-ixt-iys dxdydXj 

and by integrating with respect to y we get 
M N-l / j. \ / \ / 

J,fe=lt,S,p=0 v / \ / \ 

X <h(u k ,t- s)f(Uj,U k ,tj -t k - s+p) 

f'TT /*7T 

x / / [(j)i(uj,X) - </>i(uj,x)] 

J — IT J — 7T 

x A(uj, x)A(u k , —x)Hn(\ — x) 
x e ix ^- tk ~ t)+iXp dxd\ 

= E E"(^) A (^) ft («) ?2( ""^' ) 

j,k=lt,s,p=0 \ / \ / \ / 

X f{uj,U k ,tj -t k - s+p) 

x e N (uj,u k ,p,tj -t k -t), 

where f(u, v, k) and £yv(r) are given by f(u, v, k) = A(u, X)A(v, —X)e tXk dX, 
and 



e N (uj,u k ,p,r) 

N - 1 / m \^ r 

£^W%-,p-m) / A(u j ,x)A(u k ,-x)e i ^ r+m Ux 



2 ^( — ) / 0i(w i ,x)yl(u i ,x)A(u fe ,x)e" , ( r+p )dx 
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= f f Muj,\)A( Uj ,x)A(u k ,-x)e i{pX+rx) h(^\e im{x - x U\dx 

J-TvJ—w m=Q V / 

-2vr J T h(^jMuj,x)A( Uj ,x)A(u k ,-x)e iir+p)x dx. 

But h(§ ) = h(fi) + h'(Z p>m )!2p for some £ p , m G [0, 1]. Thus, 
e N (uj,u k ,p,r) 

= h (jf){ f ^ Mu„X)A( Uj ,x)A(u k ,-x)e^ x+r ^ 



N-l 

X 

m=0 



^ e im{x-\) dxdx 
4>i (uj , x)A(uj , x)A(u k , -x)e^ r+p)x dx 
+ z2 h '(£p,™) N / / 0i(%,A)A(u i ,j;)A(u fc ,-: 

m=0 J-ttJ-tt 



N-l 



h e( N ( u i' r ) + e iV (uj,u k ,p, r), 



where the term (uj,u k ,p,r) is given by 



N v J' 

JV-1 



^ m=0 

with gr(o;) = (/>i(uj, A)A(^,u; + A)A(u fe , -co - A)e i ( p+r ) A dA. Observe that 

by Lemma 1, for every uj,u k ,p,r, eff (uj,u k ,p,r) — > as N — > oo, conse- 
quently we can write 

4\ Uj ,u k ,p,r)= [" g(u>) f; e^^dw 

J —* m=N 

oo 

= ^2 M U jiP- m )f( U ji U ki r + 



m) 

m=N 
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On the other hand, by assumption Al, \f(uj,u k ,r+m)\ < K\r-\-m\ d ( Uj ' +d ( Uh ^ 1 . 

(2) 

Thus, the term e N (uj,u k ,p,r) is bounded by 
\£$(uj,u k ,p,r)\ 



K N ~ 1 

\m-p\cp 1 (u j ,m-p)f(u j ,u k ,r + m) 



< — \m - p\~ 2d w>\r + m^WJ+^k)- 1 

d(uj)+d(u k ) 



m=0 
N-1 



< 



m=0 

{N-1 
£ 
m=0 

1 



m p 
N~N 



r m 
N~N 



■i 



jyd(u k )-d(uj)- 



< K 







p 

x 

m 



-2d{uj) 



r 

N +X 



d(uj)+d(u k ) — l 



dxN d (. u k)-d(uj)- 



< K N d ( Uk *)~ d ( u i)~ 1 ( — 



d(uj)+d(u k ) — l f -i 





p 


f 


x 


h 


m 



-2d(uj) 



dx 



< K N~ 2d<yU ^ r d ("j)+ d ( u fc) _1 
where for notational simplicity we have dropped 9 from dg(-). Thus, 



E N (uj,u k ,p,r) = h(jzj ^2 (f>i{u j ,p-m)f(u j ,u k ,r + 

^ ' m=N 



m) 



+ 0(N~ 2d ^ u ^r d ^ +d ^ Uk ^~ 1 ). 



Hence, A^J can be written as 

M AT-1 



j,fc=lt,s,p=0 V / V / \ / 

X V 2 (^) X] $i{uj,p-m)f(u j ,u k ,t j -t k + m) 

{ ^ ' m=N+l 

+ 0(Ar 2d K)|i,- - t fc - t|««(«i)+«*(«k)-i) J 

._ A (1.2) 

say Therefore, |A^ } | < | aJ' 1} | + 1 a£' 2) | . Observe that since <fo(u,\) ~ C|A| M ( u) 
as A — >■ and d(u) > for all it G [0, 1], we conclude that 4>2(u, 0) = Ylk=-oc ( t >2 ( u i 
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k) = 0. Thus, 



JV-l 

E" 

t,s=0 
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N hl-)Mu,t-s) 



N—l JV-l 



t 1 "(^>(I4)«»^ 

t=0 k=l-N v 7 v 7 



where for simplicity we assume that h(x) = for x outside [0, 1] . Now, by an 



application of Taylor's theorem we can write h(4r + jt) = h(4r) + h'(^(t, k))4r 



for some k) G (jj - y, jj + Hence, 
JV-l JV-l 

at; Viv 

2 JV-l 



t=0 k=l-N 
JV-l 



E E M 

L-JV 
JV-l 

EMiv E 

t=0 V 7 fc=l-JV 

+ E E h(-\h'(t{t,k))Uu,k)±. 

t=0 k=l-N ^ 7 

Note that E£i-jv <?2 («, fc) = 2 Efcljv <?2 («, fc) • Therefore, | YJk=i-N^ ( n > fc ) I ^ 
KY,%: N k-™to-i<KN-*M. Consequently, K±) 2 Ek=i- N M", 

k)\ < KN l ~ 2d ( u \ On the other hand, | Y^" 1 Ek=l-N KjrWitfo k ))Mu, k) x 
#1 < K Y,k=\ k~ 2d{u) < KN 1 ~ 2d ( u \ Hence, 



JV-l JV-l 

E E * 

t=0 k=l-N 

Thus, we conclude that 

M JV-l 



t k 
N J'"\N + N 



(f) 2 (u,k) 



j,k=l,j^k p=0 



x \p-m\- 2d( - u ^- 1 \S(j-k)+m\ d ^ +d ^- 1 

m=N+l 



M JV-l 

E E 



N ' N 



d(v.j)+d(uk)—l 
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N 



E 

m=N+l 



p m 
N~N 



-2d(uj)-l 



S . . , . m 



d(iij)+d(u k )-l , 

iV 



Af 

^ E 

j,k=l,j^k 



'0 Jl 



— (j-k) + x 



(uj)+d(u k )-l 



\x-y\ 



-2d(uj)-l 



(j -k)+y 



d(uj)+d(u k ) — l 



dydx. 



Let 5 > and define Ji(<5) = {j, k = 1, M: k < j V k - j > f (1 + <5)} and 
-^2(<5) = 0) = 1, M : < k — j < ^(1 + 5)}. Therefore, the sum above can 



be written as £f fc=lj y fc - = £/ l(<5) • + E/ 2 (5) 1 : = I^n^'I + I&n^I, say. 
Observe that over h(S) we have that — k) + x\~ a < K\jj(j — k)\~ a for 
a > 0. Hence, 

|A 



(l.i.i). 



.(1.1-2)1 



.(l-l-l) I 
V 7V 



h(8) 



d(«j)+d(u fc )-l 



1 />oo 



X 




Jl 



S 



-2d(uj)-l 



d(uj)+d(u k )-l 



dy dx 



M 

^ E 



d(u,-)+d(ujfe)— 1 



1 />oo 




Jl 



|x-y| 



S 



-2d(uj)-l 



d(v,j)+d(u k ) — l 



dydx. 



Since the integrands in the above expression are all positive, an application 
of Tonelli's theorem yields 

M „ d ( Uj )+d(u k )-l 



ia£ l1) i<* E 



oo rl 



\x-y\ 



-2d(uj)-l 



1 JO 
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X 



s 



(j — k)+y 



M 

± K E 



N 

d(uj)+d(u k ) — 1 



d(uj)+d(u fc )-l 



dxdy 



X / [(y- -y- 2rf K)] 



5 



d(uj)+d(u k )-l 



dy. 



Then, by Lemma 3 we conclude that 



(i-i-i)i 

V 7V 



M 

^ E 

M 

E 



< K 



U-k) 



2d{uj)+2d{u k )-2 



U-k) 



2d{u )+2d(u k )~2 



d(uj)+d(u k )<l/2 
M 

+ E 

d( Uj )+d(u k )>l/2 



2d(M i )+ M ( u fc)- 2 



For the first summand above, we have the upper bound 

M / ,r\ 2 / AT\ 2 



£ i j '- fe i _1 (f) -^(f) MlogM ' 



while the second summand can be bounded as follows: 

M / ■ t \ 2d( Uj )+2d{u k )-2 



E 

j,k=X,j^k 
diuj)+d(u k )>l/2 



SM i 


' 3 


--) 


N \ 


K M 


M J 



< K 



<K 



- M 

E 

j,k=i 



2 k_ 

M M 



£-1 



-£ fl fl 




\x — y\ e 1 dxdy 



o J o 



( Jr) M2 ^ KM2 - 
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Thus, 
(23) 



IA 



(l.i.i), 

N 



I — ] 



On the other hand, if z = jj(k — j) then < z < 1 + S for j, k G l2($)- Thus, 
an application of Lemma 9 yields for 2 > 5 > 



,(1-1.2) | 



2d-l 



where d := info< u <i d(u) > 0. Hence, by defining p = k — j and P = N/S we 
can write 

P(l+5) 



IA 



(1.1.2), 

N 



< KM 

P =i 



1 



P_ 

P 



2d-l 



N f 1+s N 
< KM— / II -x\ 2d ~ l dx <KM- 



S Jo S 

Note that from assumption A3, N/S —> oo. Thus, by combining the above 
bound and (23) we conclude that 

2 



(24) 



( ^ ) 



A similar bound can be found for |A^' 2 ^| and consequently for |A^|. 
Furthermore, an analogous argument yields a similar bound for the term 

(2) 

\A N I appearing in (21). Now, we focus on obtaining an upper bound for the 
remaining term Rn from (22). By integrating that expression with respect 
to A we get 

M N-l 



\Rn\ < 



N 



i,k=it,s=o v / \ / 



x / 4> 2 (uk,n)A(u j ,x)A(uk,-x) 
Jn 

x A( Uj ,y)y- d ^L N (y + y)H{x - n) 
x e i x ( tj -t k+ t)+iy(t 3 -t k + s ) dxdydfI 



where the function Lat(-) is defined as 

N, \x\<l/N, 



L N (x) 



l/\x\, 1/N<\x\<ir. 
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Hence, 

l^vl < y 



M N-l 
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j,k=l t,s,p=0 



N \N \N 



x 4>i(uj,s -t)f(uj,u k ,tj -t k + t-p) 
Muk,v)A( Uj ,y)y- d ^ 

x L N ( f i + y)e iy ^- tk+ ^ +ip ^dyd^ 



M N-l 



<K~Y^ Yl l?i( u j' s_ *)ll/(%' u fe'*i ~tk + t-p)\ 



j,k=lt,s,p=0 




Muk,v)A(u v y)y- d ^ 

r 

x L N (fi + y)e iy ^- tk+s)+ip ' M dydfi 



.N 



M N-l 



j,fc=l t,s,p=0 



r r L N ( f i + y)y- d ^- d ^dyd f i 

J —77 J — 7T 



< K 



AT\ AT M N - 1 



j,k=l t,s,p=Q,t^s 



\S(j-k)+t-p\ d ^ 



)+d(« fc )-l 



.A^ log A 



Af JV-1 



< K tl ^ ^ | a _ t |-2d(uj)-l5«J(«j)-W(«*)-l 



j,k=l t,s,p=0,tj^s 



M 



x |j — / t| d ( M j) +d ( Ufe ) _1 



I" 1 



J k_ 

M M 



d(uj)+d(u k )-l 



Since by assumption A3, T/N 2 — > 0, we conclude that 

at3 yrl [-it i-tt 

(25) IR^KK^^ / Ix-y^^^AA 3 ^/ 2 ^ 2 . 
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Thus, from (24) and (25), we conclude 



[2ttMH 2<n (0)} 2 
T 



[27rMi? 2iJV (0)] 2 

M r 

V] / 4>i(uj,x)A(uj,x)A(u k ,-x)(j)2(u k ,y) 



M 

x 

j,k=i 



x A(uj,y)A(u k , -y)H N (X - x)H N (x - fi)H N (fi + y) 
xH N (-y- X)e i(x+y)(t ^ dA d/x + CV, 

where 

(26) C N = o( l ^ + ^- + NT^ 



S N 2 

Therefore, by assumption A3 we conclude that Cn = o(l). By following suc- 
cessive decompositions as in (21), we replace 4> 2 (uk,y) by (p 2 (uk,x), A(u k , —y) 
by A(uk,—x) and A(uj,y) by A(uj,x), respectively. Thus, 

TBjp T 



[2ttMH 2 ,n{0)} 2 [2ttMH 2 , n (0)} 2 

M 

X 

J 



M r 

V" / 4>i(uj,x)A(u j ,x)A(u k ,-x)4> 2 (uk,x)A(u j ,x) 



x A(u k , -x)H N (X - x)H N (x - n)H N (n + y) 
x H N (-y - X)e i{x+y) ^~ tk) dxdydXdfi + o(l). 
By integrating with respect to /i and A, we get 



TB 0) 

1 n N 



[2nMH 2iN (0)\ 2 

M 



T 



[MH 2>N (Q)] 2 ^ 



/IT /*7T 
/ 4>i(uj,x)A(uj,x) 
-7T «/ — 7T 



[M/r 2iJV (o)] 



x A(it fe , -x)(f) 2 (u k , x)A(uj,x)A(u k , -x) 
x |fT 2 ,jv(z + yjpe^^fe-**) dxdy + o(l) 

! XI / / [ < ? ;> i( u i' x )/( u i' a; )][ < ? !> 2(wfc,a;)/(n fc ,x)] 
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x \H 2N {x + y )\ 2 e iix+y)[s{j - k)] dxdy + o(l) 

= n77t — 7nvi2 Yl / / [Muj,x)f(uj,x)][<j)2(u k ,x)f(u k ,x)} 

x \H 2N (z)\ 2 e iz ^ j - k ^dxdz + o(l) 



[MH 2)N (U)f 

M n 

x ^2 / [^1 ( u j, x )f( u j> x )] [<h(u k ,x)f(u k ,x)] 

N-l 

X 



E ^ (£K (£) f] e-W- k ^ dx dz + o(l) 



,t=o 



S(j-k)=s-t 

[<f>l(Uj,x)f(Uj,x)] 

x [0 2 (n fc ,x)/(u fc ,x)] + o(l). 



By assumption A3, for S < N we can write 



J J B 



[2^Mi7 2i7V (0)] 2 * 
2vrT 



[Mtf 2iJV (0)] 2 
N-l(N-t)/S 

\N N 

t=o p=-t/S 

M ~\P\ , ff 



E E "MwXV 



X 



/7T 

x [0i(u j+p , x)f(u j+p , x)] dx + o(l). 



Observe that by the assumptions of this proposition the products 4>i(u, x)f(u, x) 
and 4>2(u,x)f(u,x) are differentiable with respect to u. Furthermore, note 
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that by assumption A3, limj' i 5_ i . 00 = for any \p\ < ^. Consequently, 
gM-\p\ 

T E / [^ 1 ( n i' x )/( n i' x )][ < A2K+ p ,a;)/(u i+ p,a;)]dx 
3=1 J ~ n 

1 pTV 

/ cf)i(u,x)(f)2(u,x)f(u,x)' 2 dxdu, 

J-n 

for any |p| < ^ as M, AT, S, T — > oo. On the other hand, 

E E >n v F Uf + 



S 2 [Mff 2jiV (0)] 2 ^ p ^ /s \NJ \N'NjN 2 

-^2vr y h 2 (x)h 2 (x + y)dxdy^ h 2 (x)dx^j = 2vr, 



as M, AT, S*, T — > oo. Therefore, in this case 



T 



[27rMtf 2i7V (0)p 



1 /-7T 



2-Bjy — )-27r / / 4>i(u,x)(j)2{u,x)f(u,x) dx du, 







as M, iV, S, T — )• oo. Similarly, we have that 



[27rMH 2 , N (0)} 



1 /-?r 



as M, A 7 ", 5, T — >■ oo. Therefore, by virtue of (20) this proposition is proved. 

□ 

Proposition 3. Let cum p (-) be the pth order cumulant with p > 3. 
Then, T p / 2 cum p ( Jt{4>)) -^0, as T — >■ oo. 

Proof. Observe that Jt(0) can be written as 

2,MH^) Y ' QWY ' 

where the block-diagonal matrix Q(cf)) is defined in (16) and Y £ ~R NM is 
a Gaussian random vector defined by Y = (Y(ui)', . . . , Y(um)')', Y(u) = 
(Yi(u),. . .,Y N (u)), Y t {u) = h(jj)Y [uT }_ N / 2 +t+i,T with Y[ uT] _ N/2+t+ i t T satis- 
fying (1). For simplicity, denote the matrix Q{4>) as Q. Since Y is Gaussian, 

9P- l (r> — 1 V 
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where R = Var(Y). Let \A\ = [ti(AA')} 1 / 2 be the Euclidean norm of matrix 
A and let \\A\\ = sup^ x ^ = i(Ax)' Ax be the spectral norm of A. Now, since 
|tr(QB)| < \Q\\B\ and \QB\ < \\Q\\\B\ we get \tv{RQ) p \ < \\RQ\\ P ~ 2 \RQ\ 2 . 

On the other hand, for fixed A, decompose the function </>(•, A) as <p(-, A) = 
</>+(•, A) — (/>_(-, A) where A), A) > 0. Thus, we can write Q = Q((ft) = 
Q(4>+ — </>-) = Q(4>+) — Q(<f>-) '■= Q+ — Q-i say. Now, by Lemma 6 we con- 
clude that 

\\RQ\\ = \\RQ+ - RQ-\\ < \\RQ+\\ + \\RQ-\\ < K{MN 1 ~ 2d T 2d ~ l ), 
and by Proposition 2 we have that |-RQ| 2 < K m2 J? . Thus, 



\tv(RQ) p \ < K{MN l - 2d T 2d ~ l ) 



T 

M 2 N 2 



T 

Consequently, 

/ N\ 0-~ 2d )(P- 2 ) /./T 
\T*l 2 ^ p [J T m\<KM^I 2 (P} 



P-2 



Since p > 3 and by assumption A2, N/T —?■ and y/T/N — > as T, N — > oo, 
the required result is obtained. □ 



3.2. Proof of theorems. 



Proof of Theorem 2.1. To prove the consistency of the Whittle es- 
timator, it suffices to show that 

sup|£ T (0)- ->0, 
e 

in probability, as T oo, where £((9) := ^ /J JZ n [log /g (u, A) + ] dX du. 

Define ge(u,X) = fo{u,X)~ l . By assumption Al, ge(u, A) is continuous in 0, 
A and u. Thus, can be approximated by the Cesaro sum of its Fourier 
series 

t=-Lm=-L v 7 v 7 

x m) exp(—i2iruj£ — iXm), 

such that supg \ge(u, A) — g^(ii,A)| < e; see, for example, Theorem 1.5(h) 
of Korner (1988). Following Theorem 3.2 of Dahlhaus (1997), we can write 



sup |£ T (#) -£(#)| 
e 
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~ + 'IirJl^ F [lN{uj,X) + /(«,-, A)] dX 

j Af /»7T 

— / exp(— i27TUj£ — iXm) 
j= i 

x {/jv(it,-,A) - f(uj,X)}dX 
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1 /*7T 



where 



gg{l,m)= I I gg(u, X) exp(i2iru£ + iXm) dudX. 

JO J~TT 

Consequently, \go{£,m)\ ^ ^ SVL P(e, u ,x)\9o( u ^)\- However, by assumption 
Al, \gg(u, A) | is continuous in 0, u and A. Thus, since the parameter space 
is compact we have that \gg(£,Tn)\ < K, for some positive constant K. Now, 
by defining for fixed £, m = 1, . . . , L, tfi(u, A) = cos(27ra^) cos(Am) or cf>(u, X) = 
sin(27ru£) cos(Am) in Proposition 1 and 0i(it, A) = <f>2(u, A) = cos(2ttu£) cos(A x 
m) or 0i (u, A) = <fo(w, A) = sin(27ra£) cos(Am) in Proposition 2, we deduce 
that 

1 L L ( 



£=-Lm=-L 



1 r 

ex P H2vru 

.7=1 17 _7r 



— i Am) 



x [Jjv(uj, A) - /(uj,A)]dA 



E (i-?)('-?)7»«»)i 

t>—_T m ——T V / \ /a 

— / cos(27T?/j^) cos(Am) 

7 = 1 ^ _7r 



< 



(27) 



£=-Lm=-L 



x [JjvO^A) -/(uj,A)]dA 
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M 



+ 



and 



1 . " r n 
— / sm(2-KUj£) cos(Am) 

x [I N (uj,X) - f(uj,X)]d\ 



M r7T r i rn 

) dX du, 



( 28 ) j-Szf{lN(u j ,X) + f(u 3 A)}dX^2f r f(u,X), 

in probability, as M — )■ oo. Now, from the limits (27) and (28), this theorem 
follows. □ 

Proof of Theorem 2.2. Let 9t be the parameter value that minimizes 
the Whittle log-likelihood function Ct{9) given by (7) and let 9q he the true 
value of the parameter. By the mean value theorem, there exists a vector 9t 
satisfying \\9~t — 9q\\ < \\9t — 0o\\, such that 

(29) VCt0t) - V£r(#o) = [V 2 C T (9 T )](9 T - 9 ). 

Therefore, it suffices to show that (a) V 2 £t(#o) — > r(0o) 5 as T — )• oo; (b) 
V 2 £ t (0t) ~ V 2 C T (0 ) -> in probability, as T — » oo; and (c) VTVC t {0 ) -> 
A^[0,r(^o)], in distribution, as T — > oo. To this end, observe that 

v 2 c T (6) = r A) - Mu^x^Mu^xy 1 

-Vfeiu^X^Vfeiuj^xy^'dX 
1 1 f Z" 71 

+ E / Vlog/ e (^,A)[Vlog/ 9 (n i ,A)] , dAl 

= i-[J T (0)-J(0)]+r((?) + o(l: 

where cj)(u,X) = X7 2 fg(u, X)" 1 . Hence, an application of Proposition 1 and 
Proposition 2 yields parts (a) and (b). On the other hand, part (c) can 
be proved by means of the cumulant method. That is, by showing that all 
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the cumulants of VTV Ct{9q) converge to zero, excepting the second order 
cumulant. To this end, note that 

V£ T (#o) = r [^(%, A) - f eo (uj, X)]Vf 0o ( Uj , X)' 1 dX 

^M4>) /eo(^,A)V/ eo (n i , A)" 1 ^ 



M 

(30) 



where <fi(u,X) =Vfe (u,X) 1 . Hence, by Proposition 1 and assumption A3, 
the first-order cumulant of VTVCt(9q) satisfies 



(31) 



as T — > oo. Furthermore, by (30) we have that the second-order cumulant of 
(9q) can be written as 

Tcov[VC T (e ),VC T (9o)} = — ^Tcav[J T ($, Jt^)]- 
Therefore, by Proposition 2 we have that 
lim Tcov[V£ T (6>o),V£ T (fl )] 

T— >oo 



1 

4tt 
1 

4tt 



1 [ W Vfe^Xy^Vfe^Xy^'fe^XfdXdu 

J-TT 

1 [ W V dog fe (u,X)[V log f eo (u,X)]'dXdu = T(9o). 

J-TT 



Finally, for p > 2, Proposition 3 gives T p / 2 cum p [V £t(9o)] — > 0, as T — > oo, 
proving part (c). □ 

Proof of Theorem 2.3. By observing that the Fisher information 
matrix evaluated at the true parameter, Tt(9q), is given by 

r T (9 )=Tco V [VC T (9 ),VC T (9 )}, 

the result is an immediate consequence of Proposition 2. □ 
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r(Th 



Proof of Theorem 2.4. Let = \V>- ) )i,j=i,..., p = Var(/3), then 

/ Var[d(u)]du = / £ J>(u)^f ] 9j {u) du 
i=i j=i 

„■ 1 „• i J 



i=l j=l 
V V 



i=i j=i 

where 6ij = Jq gi(u)gj(u) du = bji. Therefore, by Theorem 2.2 

iToo T f Var [d{u)] du= iziz ^J Tv ^ )]bij = e e ^ 

°° J ° i=l j=l °° i=l j=l 

where ^4 = (ajj)jj=i,...,p = T _1 and 

r^-^ y^ — log/(n,A) — log f{u,X)dXdu. 

But, log/(u,A) =log(a 2 ) -log(2vr) - dp(u)\og\\ - e iX \ 2 . Thus, 

^-log/(n,A) = -^(n)log|l-e u | 2 . 

Hence, = gi(u)gj{u) dux ± /^(log |1 - e iX \ 2 ) 2 d\ = . Therefore, 
r = ^g-S and A = -^B~ . Consequently, since A and B are symmetric ma- 
trices limy^oo T £ Var[d(u)] du = tr(AB) = ^ tr(J p ) = fg. □ 

4. Simulations. In order to gain some insight into the finite sample per- 
formance of the Whittle estimator discussed in Section 2, we report next a 
number of Monte Carlo experiments for the LSARFIMA model 

Y t:T = a{t/T){\ - 0B)(1 - BY d ^e u 

for t = 1, . . . , T with d(u) = ao + a±u, a(u) = /?o + Piu and Gaussian white 
noise {et} with unit variance. The samples of this LSARFIMA process are 
generated by means of the innovation algorithm; see, for example, Brockwell 
and Davis (1991), page 172. In this implementation, the covariances of the 
process {1*,t} is given by 

E[Y s , T Y t>T ] = a T[1 ~ ^ /T) ~ d{t/T)]T[s ~ ' + d{s/T)] 



TJ \TjT[l- d(s/T)]T[d(s/T)]T[s -t + 1- d(t/T)] 
! ! ja ,0 s-t-d{t/T) tf s-t + d(s/T) 



s-t-l + d(s/T) s-t + l-d(t/T) 
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Fig. 1. Contour curves of the empirical MSE of Whittle estimator. Left: sample size 
T = 512. Right: sample size T= 1024. 

for s,t = 1, . . . ,T, s>t. Let 6 = (ag, c^i, A)j Pi-i'&Y be the parameter vector. 
The Whittle estimates in these Monte Carlo simulations have been computed 
by using the cosine bell data taper (9). Figure 1 displays the contour curves 
for the empirical mean squared error (MSE) for the Whittle estimator 9 
defined in this case as the average of \\9 — 9\\ 2 over 100 replications of 9, where 
9 is the true value of the parameter. These contour curves correspond to 9 = 
(0.20, 0.25, 0.5, 0.3, 0.5), for sample sizes T = 512 and T = 1024, respectively. 
In these graphs, the darkest regions represent the minimal empirical MSE 
while clear regions indicate greater MSE values. Note that for the case T = 
512, shown in the left panel, the minimal empirical MSE region is located 
around N ~ 105 and S ~ 35. For the sample size T = 1024, displayed on the 
right panel, the minimal empirical MSE is reached close to N ~ 200 y S « 45. 
As noted in these graphs, there is a degree of flexibility for selecting TV and 
S as long they belong to the areas with minimal empirical MSE. Contour 
curves for other parameters 9 such as those presented in Tables 1 and 2 are 
similar to Figure 1 and produce similar empirical optimal regions for iV and 
S. Tables 1 and 2 report the results from the Monte Carlo simulations for 
several parameter values, based on 1000 replications. These tables show the 
average of the estimates as well as their theoretical and empirical standard 
deviations (SD). The theoretical SD are based on Theorem 2.2 with matrix 
Tg given by 

/ r a j a $\ 
r e = o o , 

where 7^ = [ log ^~^ , 2fl~^ ] / ' ^ = ' anc ^ ma t r ices r Q and Tp are 
given in Example 2.3. The bandwidth parameters N and S for each table are 
based on values found in Figure 1 for 9 = (0.20,0.25,0.5,0.3,0.5). As men- 
tioned above, these values are very similar for the other parameters reported 
in Tables 1 and 2. Observe from these tables that the estimated parameters 
are close to their true values. Besides, the empirical standard deviations 
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Table 1 

Whittle estimation: sample size T = 512, block size N = 105 and shift S = 35 



Parameters Estimates 



Case 


OLO 


cci 


/3o 


0i 


■d 


ci 




/3o 


Pi 


■d 


1 


0.15 


0.20 


0.5 


0.3 


0.5 


0.130 


0.177 


0.497 


0.299 


0.473 


2 


0.15 


0.20 


0.8 


-0.2 


0.5 


0.124 


0.167 


0.795 


-0.201 


0.463 


3 


0.20 


0.25 


0.5 


0.3 


0.5 


0.161 


0.219 


0.497 


0.301 


0.455 


4 


0.20 


0.25 


0.8 


-0.2 


0.5 


0.163 


0.218 


0.797 


-0.201 


0.453 


5 


0.30 


-0.20 


0.5 


0.3 


0.5 


0.291 


-0.183 


0.498 


0.299 


0.506 


6 


0.30 


-0.20 


0.8 


-0.2 


0.5 


0.287 


-0.183 


0.797 


-0.203 


0.501 


7 


0.15 


0.20 


0.5 


0.3 


-0.4 


0.138 


0.189 


0.496 


0.301 


-0.407 


8 


0.15 


0.20 


0.8 


-0.2 


-0.4 


0.138 


0.190 


0.799 


-0.206 


-0.410 


9 


0.20 


0.25 


0.5 


0.3 


-0.4 


0.195 


0.228 


0.498 


0.299 


-0.409 


10 


0.20 


0.25 


0.8 


-0.2 


-0.4 


0.193 


0.229 


0.795 


-0.197 


-0.412 


11 


0.30 


-0.20 


0.5 


0.3 


-0.4 


0.286 


-0.197 


0.498 


0.298 


-0.404 


12 


0.30 


-0.20 


0.8 


-0.2 


-0.4 


0.279 


-0.180 


0.796 


-0.203 


-0.404 



Theoretical SD Estimated SD 



Case 


o-(So) 


ff(Si) 


tr0o) 


<r($i) 




<t(S ) 


er(Si) 


90o) 


HPi) 


a 


(5) 


1 


0.115 


0.119 


0.035 


0.069 


0.109 


0.117 


0.146 


0.045 


0.089 





.106 


2 


0.115 


0.119 


0.047 


0.075 


0.109 


0.115 


0.146 


0.057 


0.100 





.103 


3 


0.115 


0.119 


0.035 


0.069 


0.109 


0.107 


0.132 


0.043 


0.091 





.096 


4 


0.115 


0.119 


0.047 


0.075 


0.109 


0.110 


0.131 


0.056 


0.098 





.102 


5 


0.115 


0.119 


0.035 


0.069 


0.109 


0.131 


0.140 


0.043 


0.090 





.108 


6 


0.115 


0.119 


0.047 


0.075 


0.109 


0.125 


0.140 


0.057 


0.099 





.107 


7 


0.074 


0.119 


0.035 


0.069 


0.051 


0.089 


0.155 


0.045 


0.091 





.058 


8 


0.074 


0.119 


0.047 


0.075 


0.051 


0.088 


0.150 


0.054 


0.096 





.053 


9 


0.074 


0.119 


0.035 


0.069 


0.051 


0.090 


0.142 


0.044 


0.091 





.053 


10 


0.074 


0.119 


0.047 


0.075 


0.051 


0.088 


0.142 


0.057 


0.099 





.054 


11 


0.074 


0.119 


0.035 


0.069 


0.051 


0.089 


0.140 


0.046 


0.093 





.055 


12 


0.074 


0.119 


0.047 


0.075 


0.051 


0.093 


0.146 


0.057 


0.099 





,056 



are close to their theoretical counterparts. These simulations suggest that 
the finite sample performance of the proposed estimators seems to be very 
good in terms of bias and standard deviations. This, despite the fact that in 
many of these simulations we have tested the method with large values of 
the long-memory parameter, that is, close to ^. In Table 1, for example, for 
the combination ao = 0.20, a± = 0.25, the maximum value of d(u) is 0.45. 
Additional Monte Carlo experiments with other model specifications are re- 
ported in Palma and Olea (2010). Those simulations explore the empirical 
optimal selection of N and S and the finite sample performance of the Whit- 
tle estimators. Note, however, that further research is needed to establish 
optimal selection of N and S from a theoretical perspective. A comparison of 
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Table 2 

Whittle estimation: sample size T = 1024, block size N — 200 and shift S = 45 



Parameters Estimates 



Case 


OLO 


cci 


/3o 


0i 


■d 


ci 




/3 


Pi 


■d 


1 


0.15 


0.20 


0.5 


0.3 


0.5 


0.127 


0.193 


0.498 


0.301 


0.475 


2 


0.15 


0.20 


0.8 


-0.2 


0.5 


0.131 


0.195 


0.796 


-0.198 


0.479 


3 


0.20 


0.25 


0.5 


0.3 


0.5 


0.179 


0.239 


0.497 


0.304 


0.473 


4 


0.20 


0.25 


0.8 


-0.2 


0.5 


0.176 


0.241 


0.798 


-0.199 


0.475 


5 


0.30 


-0.20 


0.5 


0.3 


0.5 


0.286 


-0.189 


0.500 


0.298 


0.493 


6 


0.30 


-0.20 


0.8 


-0.2 


0.5 


0.286 


-0.197 


0.799 


-0.202 


0.488 


7 


0.15 


0.20 


0.5 


0.3 


-0.4 


0.143 


0.198 


0.498 


0.302 


-0.404 


8 


0.15 


0.20 


0.8 


-0.2 


-0.4 


0.144 


0.197 


0.797 


-0.197 


-0.404 


9 


0.20 


0.25 


0.5 


0.3 


-0.4 


0.195 


0.245 


0.500 


0.300 


-0.405 


10 


0.20 


0.25 


0.8 


-0.2 


-0.4 


0.197 


0.243 


0.797 


-0.199 


-0.405 


11 


0.30 


-0.20 


0.5 


0.3 


-0.4 


0.293 


-0.197 


0.500 


0.299 


-0.402 


12 


0.30 


-0.20 


0.8 


-0.2 


-0.4 


0.293 


-0.200 


0.797 


-0.199 


-0.403 



Theoretical SD Estimated SD 



Case 


fr(S ) 


ff(Si) 


<T0o) 


<r0i) 


°0) 


<t(S ) 


er(Si) 


$0o) 


$0i) 


90) 


1 


0.081 


0.084 


0.025 


0.049 


0.077 


0.089 


0.106 


0.032 


0.064 


0.081 


2 


0.081 


0.084 


0.033 


0.053 


0.077 


0.097 


0.106 


0.040 


0.069 


0.085 


3 


0.081 


0.084 


0.025 


0.049 


0.077 


0.093 


0.097 


0.031 


0.062 


0.078 


4 


0.081 


0.084 


0.033 


0.053 


0.077 


0.090 


0.095 


0.038 


0.067 


0.073 


5 


0.081 


0.084 


0.025 


0.049 


0.077 


0.107 


0.103 


0.030 


0.061 


0.091 


6 


0.081 


0.084 


0.033 


0.053 


0.077 


0.101 


0.104 


0.040 


0.068 


0.079 


7 


0.052 


0.084 


0.025 


0.049 


0.036 


0.066 


0.110 


0.031 


0.061 


0.039 


8 


0.052 


0.084 


0.033 


0.053 


0.036 


0.065 


0.113 


0.039 


0.066 


0.040 


9 


0.052 


0.084 


0.025 


0.049 


0.036 


0.066 


0.100 


0.030 


0.060 


0.040 


10 


0.052 


0.084 


0.033 


0.053 


0.036 


0.058 


0.087 


0.040 


0.068 


0.037 


11 


0.052 


0.084 


0.025 


0.049 


0.036 


0.066 


0.103 


0.029 


0.060 


0.039 


12 


0.052 


0.084 


0.033 


0.053 


0.036 


0.064 


0.101 


0.039 


0.068 


0.039 



the performances of the Whittle method with a kernel maximum likelihood 
estimation approach proposed by Beran (2009) and two data illustrations 
are also discussed in that paper. 

5. Final remarks. A class of locally stationary long-memory processes 
has been addressed in this paper, which is capable of modeling nonsta- 
tionary time series data exhibiting time-varying long-range dependence. A 
computationally efficient Whittle estimation method has been proposed and 
it has been shown that these estimators possess very desirable asymptotic 
properties such as consistency, normality and efficiency. Moreover, several 
Monte Carlo simulations indicate that the estimates perform well even for 
relatively small sample sizes. 
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APPENDIX 

This appendix contains nine auxiliary lemmas used to prove the theorems 
stated in Section 2 and the propositions stated in Section 3. Proof of these 
results are provided in Palma and Olea (2010). 

Lemma 1. Let f(u,X) be a time-varying spectral density satisfying as- 
sumption Al and let (ft: [0,1] x [— tt,tt] — > M. be a function such that cft(u,X) 
is continuously differentiable in X. Consider the function defined by 

(j)(u, X + uj)f(u,u) doj, 

-TT 

and its Fourier coefficients g(u,k) = g(u, X)e~ lkX dX. Under assumption 
Al, for every uG [0,1] we have that lim^^ro X)fc=-n s( u > ^) = 2irg(u,0). 

Lemma 2. Consider the function (ft : [0, 1] x [— tt, it] — > C, such that d(ft(u, 
^)/du exists and \d(ft(u,j)/du\ < K\^\~ 2d ( u \ where < d(u) < d for all u G 
[0,1]. Then, for any < t < N we have that 



H 



N 



|,7 



Lemma 3 . Consider d\,di G [0,1/2) and for any t G Z define the integral 
1(1) = f™[(x - l)- 2dl - x" Ml ]|£ + x \dx+d2-i dx _ Then j(^) = o(\l\ M2 - 1 ). 

Lemma 4. Let (ft(u, X) be a positive function, symmetric in X, such that 
(ft(u,X)>C\X\ 2d ^ u \ forXe [-it, it], where d(u) is a positive bounded function 
for u G [0, 1] and C > 0. Let Q(u) for u G [0, 1] be the matrix defined in (16). 
Then there exists K>0 such that X'Q^u^X < KX'XN 2d ^ , for all vector 

xe 



Lemma 5. Let (ft(u,X) be a positive function, symmetric in X, such that 
(ft(u,X) > C\X\ 2d ^ u \ for X G [— it, it], where d(u) is a positive bounded function 
for u G [0, 1] and C > 0. Let Q(u) for u G [0, 1] and Q((ft) be the matrices 
defined in (16). Then there exists K > such that 

\X'[Q{(fty l - Q(<p)]X\ < KX'XN 2d+1 l 2 , 

where ip(u,-) = <ft(u,-)~ 1 /4ir 2 , d = supd(u) <oo and X £R NM . 

Lemma 6. Let (ft(u,X) be a positive function, symmetric in X, such that 
(ft(u,X) > C|A| M ("), for X G [— it, it], where d(u) is a positive bounded function 
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for u € [0,1] and C > 0. Let Q(4>) be the block- diagonal matrix defined in 
(16). Then there exists K > such that 



sup 

x 



X'RX 



< KMN 1 ~ 2d T 2d ~ l , 



X'Qi^X 
where d = sup d(u) < \ and X <E R NM . 

Lemma 7. Let /(A) and (fi(\) be two real-valued functions defined over 
A 6 [— 7r,7r] with Fourier coefficients f(k) and <p{k), respectively, satisfying 
\ f(k)cj)(k)\ < K/k 2 , for some positive constant K and \k\ > 0. Let C(N) be 

given by C(N) = Ylu^) 1 h 2 (jj) J2k=N-t /(&)<K&) with bounded data taper, 
\h(u)\ < K, for all u G [0,1]. Then there exits a positive constant K such 
that \C{N)\ <K\og 2 N. 

Lemma 8. Define D(N,T) = i Y^" 1 Ek=Lt+i W^C^) withfunc- 
tion | | < ClogiV for all < k < N , N > 1, where C is a positive con- 
stant. Then there exists a constant K > such that \D(N,T)\ < K log, T N . 

Lemma 9. Let z£ [0,1 + 5] with 2 > 5 > and 2/3 > 2a > 0. Then, the 
positive double integral I(z) = J \z — x| Q_1 J^°{y — x)~@(y — z) a ~ l dydx, 
satisfies I(z) < K\l - z\ 2a ~P . 

Acknowledgments. We are deeply thankful to the Associate Editor and 
two anonymous referees for their careful reading of the manuscript and for 
their constructive comments which led to substantial improvements. 



REFERENCES 

Beran, J. (2009). On parameter estimation for locally stationary long-memory processes. 
J. Statist. Plana. Inference 139 900-915. MR2479836 

Brillinger, D. R. (1981). Time Series: Data Analysis and Theory, 2nd ed. Holden-Day, 
Oakland, CA. MR0595684 

Brockwell, P. J. and Davis, R. A. (1991). Time Series: Theory and Methods, 2nd ed. 
Springer, New York. MR1093459 

Chandler, G. and Polonik, W. (2006). Discrimination of locally stationary time series 
based on the excess mass functional. J. Amer. Statist. Assoc. 101 240-253. MR2268042 

Dahlhaus, R. (1996). On the Kullback-Leibler information divergence of locally station- 
ary processes. Stochastic Process. Appl. 62 139-168. MR1388767 

Dahlhaus, R. (1997). Fitting time series models to nonstationary processes. Ann. Statist. 
25 1-37. MR1429916 

Dahlhaus, R. (2000). A likelihood approximation for locally stationary processes. Ann. 

Statist. 28 1762-1794. MR1835040 
Dahlhaus, R. and Polonik, W. (2006). Nonparametric quasi-maximum likelihood 

estimation for Gaussian locally stationary processes. Ann. Statist. 34 2790-2824. 

MR2329468 



42 



W. PALMA AND R. OLEA 



Dahlhaus, R. and Polonik, W. (2009). Empirical spectral processes for locally station- 
ary time series. Bernoulli 15 1-39. MR2546797 

Doukhan, P., Oppenheim, G. and Taqqu, M. S., eds. (2003). Theory and Applications 
of Long-Range Dependence. Birkhauser, Boston, MA. MR1956041 

Fryzlewicz, P., Sapatinas, T. and Subba Rao, S. (2006). A Haar-Fisz technique for 
locally stationary volatility estimation. Biometrika 93 687-704. MR2261451 

Genton, M. and Perrin, O. (2004). On a time deformation reducing nonstationary 
stochastic processes to local stationarity. J. Appl. Probab. 41 236-249. MR2036285 

Gradshteyn, I. S. and Ryzhik, I. M. (2000). Table of Integrals, Series, and Products, 
6th ed. Academic Press, San Diego, CA. MR1773820 

Granger, C. W. J. and Ding, Z. (1996). Varieties of long memory models. J. Econo- 
metrics 73 61-77. MR1410001 

Guo, W., Dai, M., Ombao, H. C. and von Sachs, R. (2003). Smoothing spline ANOVA 
for time-dependent spectral analysis. J. Amer. Statist. Assoc. 98 643-652. MR2011677 

Jensen, M. J. and Whitcher, B. (2000). Time-varying long memory in volatility: De- 
tection and estimation with wavelets. Technical report, EURANDOM. 

Korner, T. W. (1988). Fourier Analysis. Cambridge Univ. Press, Cambridge. 
MR0924154 

Orbe, S., Ferreira, E. and Rodriguez-Poo, J. (2005). Nonparametric estimation 
of time varying parameters under shape restrictions. J. Econometrics 126 53-77. 
MR2118278 

Palma, W. (2007). Long-Memory Time Series: Theory and Methods. Wiley, Hoboken, 
NJ. MR2297359 

Palma, W. and Olea, R. (2010). Supplement to "An efficient estimator for locally sta- 
tionary Gaussian long-memory processes." DOI: 10. 1214/10- AOS812. 

Priestley, M. B. (1965). Evolutionary spectra and non-stationary processes. J. Roy. 
Statist. Soc. Ser. B 27 204-237. MR0 199886 

VON Sachs, R. and MacGibbon, B. (2000). Non-parametric curve estimation by wavelet 
thresholding with locally stationary errors. Scand. J. Statist. 27 475-499. MR1795776 

Whittle, P. (1953). Estimation and information in stationary time series. Ark. Mat. 2 
423-434. MR0060797 

Department of Statistics 

Pontificia Universidad Catolica de Chile 

Vicuna Mackenna 4860 

Macul, Santiago 

Chile 

E-MAIL: wilfredo@mat.puc.cl 



